# ViT Vision Transformer
Screenshots Detection To Classification
Apache-2.0
A screenshot detection and classification model based on ViT architecture, excelling in image classification tasks
Image Classification
Transformers

S
al-css
78
2
Pneumonia Model
A deep learning model based on ViT architecture for identifying pneumonia symptoms in chest X-ray images
Image Classification
Transformers

P
Borjamg
25
1
Facial Age Image Detection
Apache-2.0
A model trained using Vision Transformer (ViT) architecture to predict age ranges from facial images
Face-related
Transformers

F
dima806
768
11
Vit Base Patch16 224 In21k Face Recognition
Apache-2.0
This model is a face recognition model fine-tuned on an image folder dataset based on Google's ViT architecture, achieving near-perfect accuracy on the evaluation set.
Face-related
Transformers

V
jayanta
216
12
Facial Emotions Image Detection
Apache-2.0
A facial emotion recognition model fine-tuned based on Google's ViT-base model, achieving 91% accuracy on the test set.
Face-related
Transformers

F
dima806
198.83k
81
Medicinal Plants Image Detection
Apache-2.0
A Vision Transformer (ViT)-based image classification model for Indian medicinal plant leaves, capable of accurately identifying over 50 types of traditional Indian medicinal plants.
Image Classification
Transformers

M
dima806
627
7
Vit Base Patch16 224 In21k Weather Images Classification
Apache-2.0
A weather image classification model based on Vision Transformer architecture, fine-tuned on the Kaggle weather dataset with an accuracy of 93.4%
Image Classification
Transformers English

V
DunnBC22
236
2
Vit Base Patch16 224 Album Vitvmmrdb Make Model Album Pred
Apache-2.0
A visual classification model fine-tuned on an unknown dataset based on Google's ViT model, excelling in image classification tasks
Image Classification
Transformers

V
venetis
33
0
Vit Face Expression
Apache-2.0
A facial emotion recognition model fine-tuned based on Vision Transformer (ViT), supporting 7 expression classifications
Face-related
Transformers

V
trpakov
9.2M
66
Vit Base Patch16 224 Finetuned Imageclassification
Apache-2.0
Image classification model fine-tuned on image folder dataset based on Google's ViT model, achieving 95.02% accuracy
Image Classification
Transformers

V
thaonguyen274
13
0
Stanford Car Vit Patch16
Apache-2.0
This is an image classification model based on the Vision Transformer (ViT) architecture, specifically fine-tuned on the Stanford Cars dataset for fine-grained classification of 196 car models.
Image Classification
Transformers

S
therealcyberlord
665
5
Dog Food Vit Base Patch16 224 In21k
This is an image classification model based on the Vision Transformer (ViT) architecture, specifically designed to distinguish between images of dogs and food.
Image Classification
Transformers

D
sasha
32
0
Rock Challenge ViT Two By Two
This is an image classification model based on the ViT architecture, specifically designed for rock particle classification tasks, achieving an accuracy of 96.6%.
Image Classification
Transformers

R
dimbyTa
15
0
Featured Recommended AI Models